Combining Expression and Content in Domains for Dialog Managers 
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Abstract 

We present work in progress on abstracting dia- 
log managers from their domain in order to im- 
plement a dialog manager development tool which 
takes (among other data) a domain description as 
input and delivers a new dialog manager for the de- 
scribed domain as output. Thereby we will focus on 
two topics; firstly, the construction of domain de- 
scriptions with description logics and secondly, the 
interpretation of utterances in a given domain. 

1 Introduction 

Current research on dialog management is guided by two 
different ideas: firstly, to describe the discourse structure 
that the dialog manager is able to handle by a finite 
automaton using possible utterances as transitions (e.g. 
pof), and secondly, to view the detection of discourse 
structure as a parameter estimation problem and to use 
statistical models for the description of discourse struc- 
ture (e.g. @, §). 

Some serious problems remain with each of these ap- 
proaches: first of all, they do not integrate a user model. 
The finite state method imposes hard restrictions on 
"free dialog" , as far as vocabulary, syntax and order of 
utterances are concerned. The statistical approach, on 
the other hand, suffers from the sparse data problem and 
does not give theoretical insight into discourse theory. 

But as researchers point out (see e.g. ||), it is 

important to explore the structure of discourse and the 
interactions of dialog and user model in order to obtain 
robust dialog systems. Studying the effect of natural 
language expressions on the state of discourse this paper 
tries to do a step in this direction by discussing the use 
of description logics for defining dialog domains formally 
and by describing how inference is performed on the basis 
of such a framework. 

Following Hjelmslev |t]], Eco Q defines two essential 
ingredients for any semiotic system (and, in particular, 
any natural language): 



• Expressiveness: what is the vocabulary, phonetics, 
and syntax of the language considered? 

• Content: what knowledge can be expressed by a given 
system? How (i.e. by which expressions) is this knowl- 
edge organised in the semiotic system? 

Obviously, the key problem is how to connect content 
and expressions in order to capture the meaning of ex- 
pressions. Russell |1| proposes to divide the vocabulary 
into two classes: 

• object words: define basic notions (in a train informa- 
tion scenario e.g. train, time, or station) 

• dictionary words: can be defined using other words 
with already defined meaning (e.g. departure time or 
arrival station) 

Trivially, there is an obvious analogy between object 
words and primitive concepts in description logics and 
dictionary words and derived concepts. We exploit this 
analogy to describe the application domain for a dialog 
manager by means of a terminology in DL. Doing this, 
we can define the meaning of basic and derived notions 
that are relevant in the domain under consideration. 

2 A-DRT and DL 

A basic concept of any dialog system is a theory of dis- 
course. We use Discourse Representation Theory (DRT) 
as introduced by || to describe the discourse generated 
during a dialog. DRT has simple semantics: A discourse 
representation structure (DRS) K with 



K = 
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condi(di, g?2, . 
cond2 (di , g?2 , • 



can be mapped into the logic language capturing the 
syntax of condi(di, di, ...) by: 

3di,d2, ■■■ : condi(di, da, ■■■) A cond2 (d± , d 2 , ■■■) 



This mapping shows that a DRS essentially defines a set 
of assertions and thereby expresses extensional knowl- 
edge declared throughout the discourse. 

On the other hand, a terminology of a certain domain 
can describe intensional knowledge used while construct- 
ing DRS out of utterances: the linguistic parser has to 
verify constraints imposed by subcategorization. E.g. a 
train usually departs from a station, i.e., a city name is 
a valid object of Depart if there exists a railway sta- 
tion. To evaluate this constraint the parser makes use of 
DL reasoning services while constructing the semantics 
of the current utterance. 

In this sense the incorporation of DL into our approach 
to dialog theory is crucial to overcome some of the limita- 
tions^] of first order logic for describing natural language 
semantics. 

3 Partial Information in Dialogs 

A key issue for dialog management is the handling of par- 
tial information provoked by "incomplete" utterances. 
By this notion we mean the fact that in the shared knowl- 
edge of the participants there is not enough information 
available to, e.g., answer a question posed by the speaker. 
For the sake of illustration, let us consider the query 

When does a train leave to Rome? 

The information given in this query is (only) partial as 
it is incomplete or insufficient for the hearer to answer it. 
At least the station of departure is unknown and has to 
be asked for by the hearer if he wants to give some rea- 
sonable answer. Notably, the open world assumption of 
DL does not suffice to handle, because it does not allow 
to make an assumption about a formula (representing a 
query as above), not even by default. 

We use First Order Partial Information Ionic Logic 
(FIL), as introduced by Abdallah Q, to handle situa- 
tions of partial knowledge. FIL is a first order language 
whose interpretation function is partial, i.e. a formula 
can have a undefined truth value. Of course, interpreta- 
tion functions can be ordered partially by 

X < J dom(J) C dom(J) 
Vie G dom(I) : J(x) = X(x) 

In addition to first order formulae, FIL provides for- 
mulae (so called ionic formulae) of the form 

*({fa,...,fa},£). 

Given a partial interpretation X that satisfies a set of 
standard first order formulae S, the ionic formula above 

First order logic lacks a mechanism for constructing well- 
defined terms from utterances. Its model theory is inten- 
sionally and ontologically inadequate for natural language 
expressions. 



says that £ is true in an extension of X as long as there 
is no extension of X that assigns false to at least one fa. 

$ = {fa, ...,fa} is called justification set or justifica- 
tion context. Stating the semantics of a ionic formula 
more informally, we have that £ is true (by default) as 
long as there is no evidence to the contrary from justifi- 
cation context $0. 

4 Partial Information and Dialog Plans 

As outlined above, the main characteristic of dialogs 
is the fact that information is given stepwise in the 
course of several utterances. For the design of a domain- 
independent dialog manager it is therefore important to 
develop an interpretation algorithm for utterances that 
is able to interact with the user in order to collect the 
neccessary information in any order. 

Ionic formulae provide such a mechanism. Their jus- 
tification contexts allow for infering what information is 
still missing while interpreting the current utterance. On 
the other hand, a justification context can be seen as a 
set of default assumptions to be accepted or rejected by 
the user later on. 

From this point of view a justification context is the 
set of dialog goals to be fulfilled by the dialog manager 
in order to compute an answer to the user's question. 

5 Integrating FIL and DL 

To combine the advantages of DL as a concept language 
for domain modelling and FIL to describe partiality we 
integrate domain models into the FIL based reasoning. 

To do this, we have to characterize the part of a do- 
main model that can cause situations of partial informa- 
tion during a dialog. We state that any role that has the 
concept User as domain or range is a source of "missing 
information" as these roles describe the user's attitudes 
eventually to be clarified in several dialog steps. There- 
fore, we define a set UserRel as follows: 

UserRel = {R : dom(-R) C User V range(i?) C User} 

For a formal description of the connection of DL and 
FIL, we give a translation mapping r from DL to FIL 
which is sort of sensitive to UserRel defined above. The 

2 In fact, Jlj explains that an interpretation function for $ 
can assign 1 to <E> ($ is acceptable, written: + * $), or ($ 
is inacceptable: — *<!?), or not assign 1 ($ is not acceptable: 
+*$), or not assign ($ is not inacceptable: — *$). This 
is due to the fact that in a partial logic for a relation sym- 
bol R(xi, Xn) one has two sets R + and R~ to declare the 
semantics of R(x\, x n ): R(xi, x n ) is true if and only if 
(x\,x..., x n ) £ R + and false if and only if (x\,x..., x n ) £ R~ . 
R + U R~ do not exploit the universe IA. So one can de- 
scribe the four "positions" of (x\, x n ) relative to R + or 
R~ , respectively. 



mapping, of course, resembles Borgida's || and the one 
by [|l6) and is identical except of the names of roles: 

Ax, y. *({R(x,y)},R(x,y)), R G UserRel 
Xx,y.R(x,y), otherwise 



t(R) = 



This says that all role names of a given TBox marked 
as partial are translated as ionic formulae. All other 
symbols are translated to first order formulae as usual 
by structural induction on the syntax of the DL. For 
example, given roles R, S±, and 52 with R = S± LA S 2 , 
the definition of r is: 

t(R) = \x,y.r(Si)(x,y) V T(S 2 )(x,y) 

whereas for role R and concepts C\ and C 2 with C\ = 
V-R.C2 we have 

r(Cx) - Ax.Vy : r(R)(x,y) — » r(C 2 )(x) 

As it is shown by e.g. Borgida in the mapping pre- 
serves satisfiability of DL expressions that are translated 
into a FIL formula, as FIL is an extension of First Order 
Predicate Logic and the translation from FIL to FOPL 
only uses the FOPL "sublanguage" of FIL. I.e. if a for- 
mula has some model in DL, then it has one in FIL, too. 
On the other hand, if a FIL expression is satisfiable, then 
it is in DL, too, if no ionic formula is contained (see ||l6|). 

A ionic formula *(R(a, 0), R(a, (3)) is the translation 
of some role R E UserRel whose justification context 
R(a, (3) can have one of the following states of acceptance 

• + * R(a,/3) or — *R(a,f3): I.e. for any model M it 
is impossible that M\y= R{a,(3) <-> M |= ^R{a,(3). 
This observation implies that either M \= R(a, j3) 
or R(a, (3) is undefined. So |= r(a :: R : (3) = 
*(R(a, (3), R(a, (3)) does not contradict |= a :: R : /3. 

• — * R{a,(3) or ~+*R(a, (3): in this case, analogously, 
either M.\\/= R(a, (3) or R(a,(3) is undefined implying 
that a :: -*R : (3 is satisfiable. 

In any case, r preserves satisfiability depending on the 
state of acceptance of justification contexts. 

6 Discussion of an Example 

In the train information application that is considered 
throughout this paper, an appropriate terminology could 
include the following concepts and roles: 

• Train, Depart, Time, Station 

• At : Train x Time 

To : Train x ArrStation 
From : Train x DepStation 
DepartFrom : User x Station 



DepartFrom is in UserRel and therefore mapped as: 

Au, s.*({DcpartFrom(u, s)},DepartFrom(it, s)) 

Among many others we have the concepts 

DepStation = 3DepartFrom _1 .User 

n Station 
TrainFrom = BFrom. DepStation 
n Train n Depart 
TrainAtFrom = BAt.Time n TrainFrom 
TrainAtFromTo = 3To. Station 

n TrainAtFrom 

This terminology is translated to FIL follows (variables 
all-quantified) : 



DepStation(s) 
TrainFrom(t) 
TrainAtFrom(t) 
TrainAtFromTo(t) 



DepartFrom (s, u) 
AUser(u) A Station(s) (1) 
From(t, s) A DepStation(s) 
ATrain(i) A Depart (t) (2) 
At(i, d) A Timc(d) 
A TrainFrom(t) (3) 
To(i, s) A ArrStation(s) 
A TrainAtFrom(t) (4) 



On the basis of the terminology above the utterance 

When does a train depart to Rome? 
has the discourse representation structure (DRS) 

t Rome 



Ax. 



Train(t) 

Depart(t) 

Time(a;) 

ArrStation(Rome) 
At(t, x) 
To(t,Rome) 



This structure is built up relying on DL reason- 
ing: Rome is contained in the (linguistic lexicon) as 
CityName(Rome). In this example, the preposition to 
is mapped onto To as defined above. In order to com- 
bine train, to, and Rome, one has to check whether 
Rome e 3HasArrStation. Station which is evaluated 
by the data base to be true for Rome. This results in 
ArrStation(Rome). Generally, DRS are constructed by 
means of instance checking that expands type unifica- 
tion. 

To infer TrainFrom(t) on the basis of the information 
available from the utterance, it is necessary to determine 
the truth value of <j)(s) = As. DepartFrom -1 (s, u) (see eq. 
0; variable u is bound to constant u G User). A first 



order approach would answer false, because its interpre- 
tation function is total and there is no information in 
the DRS above in order to substitute a station name for 
s (making tj> true). But in FIL we have (see above): 

DepartFrom _1 (s, u) <^=> *({DepartFrom(it, s)}, 

DepartFrom(u, s)) 

Based on this rule, we can conclude immediately that 
(j)(s) be true iff £(s) = As.DepartFrom(u, s) is true unless 
there is information to the contrary (see Sect. ||). 

As £'s justification context still contains an unbound 
variable, the dialog manager interprets it as a question 
to be posed to the user (no default assumptions can be 
made). Therefore, the discourse plan will be updated 
and the dialog manager will react with 

Where do you depart to from? 

because s € Station. The answer 
From Milan. 

adds DepartFrom(u, Milan) to the shared knowledge so 
that the dialog manager can bind s to Milan. After 
that, we can infer by eq. || and eq. || TrainAtFromTo(t) 
substituting x by d and d by all constants c for whom 
At(t,c) A Time(c) is true. 

The dialog plan is based on a speech act model^j. 
Speech acts are determined by reasoning on the user's 
attitudes, grammatical information from, and coherence 
of utterances. 

7 Prom Notions to Vocabulary 

To make a dialog system understand the user it has to 
know how expressions in natural language are connected 
to the abstract notions of the domain modell. 

Natural languages normally offer the possibility to ex- 
press the same notion by different synonyms. For exam- 
ple, one can say: When does the train leave to Rome? 
or, alternatively When does the train depart to Rome? 
In both cases, the notion of train departure is expressed. 

When we inspected the EVAR|] corpus of train infor- 
mation dialogs, we could see that synonyms occur fre- 
quently, even in relatively simple domains as train in- 
formation. Our hypothesis is that in much larger more 
complicated domains that allow for a greater variabil- 
ity of expressions and vocabulary there are even more 
synonyms for a certain abstract notion. 

3 For information dialogs we assume as minimal the set 
{inform, query, suggest, accept, reject}. This set can be ex- 
tended to fit the needs of a certain application. See for 
details. 

4 EVAR is a publicly accessible information system on Ger- 
man Railway InterCity connections ( H ) . The existing corpus 
of more than 1100 annotated dialogs contains samples of "real 
world" data with "naive" users. 



To handle this phenomenon, we have to extend our 
domain description by adding formulae like 

Synf(x) V-.-VSyn£(aO =* C(x) 

for any concept C and and its synonymic expressions 
Syn^ (x) and, equally, 

Synf (x, y) V • • • V Syn^z, y) =^ R(x, y) 

for any role R and its synonyms Synf (x, y). 

For the two questions above we would introduce 

leave(cc) V depart(x) =>- Depart(:e) 

"mapping" the verbs leave and depart to Depart. 

8 Pragmatics of Concepts and Roles 

Except of constructing a linkage between a domain 
model in terms of DL and a language model for the given 
domain, a configurable dialog manager must define an 
interface between its logical domain model and an ar- 
bitrary (mostly non-logical) problem solving component 
for the domain. 

The reasoning mechanisms of the dialog manager and 
the problem solver, respectively, can be linked as follows: 
The problem solver evaluates relations between (i.e. roles 
of) discourse referents that are instantiations of concepts 
according to the previous utterances. 

In the example outlined above, t can be asserted 
TrainAtFromTo only if the query 

At(t, x) A From(t, Milan) A To(t, Rome) 

can be evaluated successfully (e.g. as a query to a 
database) and returns a list of connections from Milan to 
Rome. They can serve as data for continuing the dialog. 

In the way outlined we can define an interface between 
dialog management and the pragmatics of the applica- 
tion which is by nature independent of a specific do- 
main and therefore allows for abstraction of the dialog 
manager from its underlying domain-dependent problem 
solving component, linguistics, and discourse structure. 

9 Configuring a Dialog Manager 

For the practical purpose of adapting a dialog manager 
for a specific task it is of great importance to observe that 
dialogs consist of a domain-independent and domain- 
dependent utterances. 

Domain independence is related to establishing mu- 
tual understanding, dialog segmentation, and reference 
resolution. It is very important to consider these phe- 
nomena as they can be expressed in natural language: 
e.g. "Could you repeat that?" , "Pardon?" , "Next I want 
to ask you ..." . Not to take such expressions into account 



would result in poor understanding capabilities of the di- 
alog manager. Intentions, speech acts, and obligations 
can be expressed explicitly as well^. 

A model of these domain independent notions forms 
the basic capabilities of the dialog manager to engage in 
natural language conversation. To configure it for a cer- 
tain application, one has to expand the model describing 
application-specific (i.e. domain-dependent) notions by 

1. Adapting, extending, and specializing the given DL 
model so that it defines all notions of importance for 
the application. DL systems support the phase of de- 
signing a domain model by means of testing the sat- 
isfiability of terminologies. This is a major practi- 
cal advantage for ensuring the robustness of the dia- 
log manager compared to other approaches to domain 
modelling. 

2. Defining the interface between the problem solving 
component and the dialog manager. I.e. defining what 
concept and role symbols will be evaluated by what 
functions of the problem solver. 

10 Conclusions 

We have established a connection between DL and FIL 
in terms of satisfiability via a mapping between formulae 
of each language. The correctness of inferences is assured 
by FIL (see Q). This allows to conclude the correctness 
of reasoning with DRS as they can be mapped onto FIL 
formulae (see Q and ||). We consider the combination 
of DL's knowledge representation facilities and FIL's in- 
ference mechanism for partial knowledge a well-founded 
basis for utterance interpretation and the description of 
mixed initiative dialogs in order to implement the "core 
engine" of an adaptive, configurable, and domain inde- 
pendent dialog manager. In this way we can separate 
linguistics and discourse theory from the knowledge en- 
gineering task to describe the application domain. This 
task can be performed by an application expert even 
without deep knowledge of dialog managers. 

11 Future Research 

Evidence from a number of experiments shows that 
humans perform domain reasoning while incrementally 
matching the speaker's utterances with their own expec- 
tations (see e.g. [l8|). Following this line of research, 
we are studying the use of domain descriptions and DL 
reasoning services to model how dialog participants es- 
tablish mutual beliefs on the basis of utterances. 

Secondly, cooperating industrial partners we want to 
generate domain models for different domains. For the 

J By modifying this dialog model, one can influence the 
planning of the dialog manager (c.f. footnote in Sect. 0). 



acquisition of "synonym knowledge" we will apply learn- 
ing techniques from corpora of example dialogs collected 
by our partners. General work on DL learning (see e.g. 
0) as well as work on the combination of DL and linguis- 
tic processing (as done in |l7]]) will have to be considered. 

Finally, we are working on user-friendly tools to de- 
fine interfaces between domain models and its related 
problem solving components. 
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